NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Steering Away from Harm: An Adaptive Approach to Defending Vision Language Model Against Jailbreaks

Wang, Han; Wang, Gang; Zhang, Huan (June 2025, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR))

Full Text Available
Steering away from harm: An adaptive approach to defending vision language model against jailbreaks

Wang, Han; Wang, Gang; Zhang, Huan (June 2025, Proceedings of the Computer Vision and Pattern Recognition Conference)

Vision Language Models (VLMs) can produce unintended and harmful content when exposed to adversarial attacks, particularly because their vision capabilities create new vulnerabilities. Existing defenses, such as input preprocessing, adversarial training, and response evaluation-based methods, are often impractical for real-world deployment due to their high costs. To address this challenge, we propose ASTRA, an efficient and effective defense by adaptively steering models away from adversarial feature directions to resist VLM attacks. Our key procedures involve finding transferable steering vectors representing the direction of harmful response and applying adaptive activation steering to remove these directions at inference time. To create effective steering vectors, we randomly ablate the visual tokens from the adversarial images and identify those most strongly associated with jailbreaks. These tokens are then used to construct steering vectors. During inference, we perform the adaptive steering method that involves the projection between the steering vectors and calibrated activation, resulting in little performance drops on benign inputs while strongly avoiding harmful outputs under adversarial inputs. Extensive experiments across multiple models and baselines demonstrate our state-of-the-art performance and high efficiency in mitigating jailbreak risks. Additionally, ASTRA exhibits good transferability, defending against unseen attacks (ie, structured-based attack, perturbation-based attack with project gradient descent variants, and text-only attack).
more » « less
Full Text Available
The Emperor's New Clothes in Benchmarking? A Rigorous Examination of Mitigation Strategies for LLM Benchmark Data Contamination

Sun, Yifan; Wang, Han; Li, Dongbai; Wang, Gang; Zhang, Huan (July 2025, Proceedings of The 42nd International Conference on Machine Learning (ICML))

Full Text Available
SDP-CROWN: Efficient Bound Propagation for Neural Network Verification with Tightness of Semidefinite Programming

Chiu, Hong-Ming; Chen, Hao; Zhang, Huan; Zhang, Richard Y (July 2025, International Conference on Machine Learning 2025)

Full Text Available
Neural Network Verification with Branch-and-Bound for General Nonlinearities

Shi, Zhouxing; Jin, Qirui; Kolter, Zico; Jana, Suman; Hsieh, Cho-Jui; Zhang, Huan (May 2025, 31st International Conference on Tools and Algorithms for the Construction and Analysis of Systems)

Full Text Available
Verified Safe Reinforcement Learning for Neural Network Dynamic Models

Wu, Junlin; Zhang, Huan; Vorobeychik, Yevgeniy (December 2024, Neural Information Processing Systems)

Full Text Available
Verified Safe Reinforcement Learning for Neural Network Dynamic Models

Wu, Junlin; Zhang, Huan; Vorobeychik, Yevgeniy (December 2024, Neural Information Processing Systems)

Full Text Available
Sunlight‐sensitive carbon dots for plant immunity priming and pathogen defence

https://doi.org/10.1111/pbi.70050

Kou, Erfeng; Luo, Zhongxu; Ye, Jingyi; Chen, Xu; Lu, Dan; Landry, Markita P; Zhang, Honglu; Zhang, Huan (March 2025, Plant Biotechnology Journal)

Summary Global food production faces persistent threats from environmental challenges and pathogenic attacks, leading to significant yield losses. Conventional strategies to combat pathogens, such as fungicides and disease‐resistant breeding, are limited by environmental contamination and emergence of pathogen resistance. Herein, we engineered sunlight‐sensitive and biodegradable carbon dots (CDs) capable of generating reactive oxygen species (ROS), offering a novel and sustainable approach for plant protection. Our study demonstrates that CDs function as dual‐purpose materials: priming plant immune responses and serving as broad‐spectrum antifungal agents. Foliar application of CDs generated ROS under light, and the ROS could damage the plant cell wall and trigger cell wall‐mediated immunity. Immune activation enhanced plant resistance against pathogens without compromising photosynthetic efficiency or yield. Specifically, spray treatment with CDs at 240 mg/L (2 mL per plant) reduced the incidence of grey mould inN. benthamianaand tomato leaves by 44% and 12%, respectively, and late blight in tomato leaves by 31%. Moreover, CDs (480 mg/L, 1 mL) combined with continuous sunlight irradiation (simulated by xenon lamp, 9.4 × 10⁵lux) showed a broad‐spectrum antifungal activity. The inhibition ratios for mycelium growth were 66.5% forP. capsici, 8% forS. sclerotiorumand 100% forB. cinerea, respectively. Mechanistic studies revealed that CDs effectively inhibited mycelium growth by damaging hyphae and spore structures, thereby disrupting the propagation and vitality of pathogens. These findings suggest that CDs offer a promising, eco‐friendly strategy for sustainable crop protection, with potential for practical agricultural applications that maintain crop yields and minimize environmental impact.
more » « less
Full Text Available
Scalable Neural Network Verification with Branch-and-bound Inferred Cutting Planes

Zhou, Duo; Brix, Christopher; Hanasusanto, Grani A; Zhang, Huan (December 2024, 38th Conference on Neural Information Processing Systems)

Full Text Available
Geometric model for dynamics of motor-driven centrosomal asters

https://doi.org/10.1103/PhysRevResearch.7.013004

Young, Yuan-Nan; Herrera, Vicente Gomez; Zhang, Huan; Farhadifar, Reza; Shelley, Michael J (January 2025, Physical Review Research)

The centrosomal aster is a mobile and adaptable cellular organelle that exerts and transmits forces necessary for tasks such as nuclear migration and spindle positioning. Recent experimental and theoretical studies of nematode and human cells demonstrate that pulling forces on asters by cortically anchored force generators are dominant during such processes. Here, we present a comprehensive investigation of the S-model (S for stoichiometry) of aster dynamics based solely on such forces. The model evolves the astral centrosome position, a probability field of cell-surface motor occupancy by centrosomal microtubules (under an assumption of stoichiometric binding), and free boundaries of unattached, growing microtubules. We show how cell shape affects the stability of centering of the aster, and its transition to oscillations with increasing motor number. Seeking to understand observations in single-cell nematode embryos, we use highly accurate simulations to examine the nonlinear structures of the bifurcations, and demonstrate the importance of binding domain overlap to interpreting genetic perturbation experiments. We find a generally rich dynamical landscape, dependent upon cell shape, such as internal constant-velocity equatorial orbits of asters that can be seen as traveling wave solutions. Finally, we study the interactions of multiple asters which we demonstrate an effective mutual repulsion due to their competition for surface force generators. We find, amazingly, that centrosomes can relax onto the vertices of platonic and nonplatonic solids, very closely mirroring the results of the classical Thomson problem for energy-minimizing configurations of electrons constrained to a sphere and interacting via repulsive Coulomb potentials. Our findings both explain experimental observations, providing insights into the mechanisms governing spindle positioning and cell division dynamics, and show the possibility of new nonlinear phenomena in cell biology. Published by the American Physical Society2025
more » « less
Full Text Available

« Prev Next »

Search for: All records